Ensemble Validation: Selectivity has a Price, but Variety is Free

نویسندگان

  • Eric Bax
  • Farshad Kooti
چکیده

If classifiers are selected from a hypothesis class to form an ensemble, bounds on average error rate over the selected classifiers include a component for selectivity, which grows as the fraction of hypothesis classifiers selected for the ensemble shrinks, and a component for variety, which grows with the size of the hypothesis class or in-sample data set. We show that the component for selectivity asymptotically dominates the component for variety, meaning that variety is essentially free.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ar X iv : 1 61 0 . 01 23 4 v 1 [ st at . M L ] 4 O ct 2 01 6 Ensemble Validation : Selectivity has a Price , but Variety is Free

If classifiers are selected from a hypothesis class to form an ensemble, bounds on average error rate over the selected classifiers include a component for selectivity, which grows as the fraction of hypothesis classifiers selected for the ensemble shrinks, and a component for variety, which grows with the size of the hypothesis class or in-sample data set. We show that the component for select...

متن کامل

Ensemble strategies to build neural network to facilitate decision making

There are three major strategies to form neural network ensembles. The simplest one is the Cross Validation strategy in which all members are trained with the same training data. Bagging and boosting strategies pro-duce perturbed sample from training data. This paper provides an ideal model based on two important factors: activation function and number of neurons in the hidden layer and based u...

متن کامل

Validation of Synoptic Station Data Using Ensemble Classification on Central Iran

Today, the use of data recorded in synoptic stations of the country is one of the most significant sources of applied research for researchers. Data recorded automatically or manually at synoptic, climatological, and other stations are analyzed for statistical analysis. In this research, the data recorded in the synoptic stations of Iran, which are used to determine the days of dust, were analy...

متن کامل

The Impact of Oil and Gold Prices’ Shock on Tehran Stock Exchange: A Copula Approach

There are several researches that deal with the behavior of SEs and their relationships with different economical factors. These range from papers dealing with this subject through econometrical procedures to statistical methods known as copula. This article considers the impact of oil and gold price on Tehran Stock Exchange market (TSE). Oil and gold are two factors that are essential for the ...

متن کامل

Statistical properties of daily ensemble variables in the Chinese stock markets

We study dynamical behavior of the Chinese stock markets by investigating the statistical properties of daily ensemble returns and varieties defined respectively as the mean and the standard deviation of the ensemble daily price returns of a portfolio of stocks traded in China’s stock markets on a given day. The distribution of the daily ensemble returns has an exponential form in the center an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1610.01234  شماره 

صفحات  -

تاریخ انتشار 2016